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Abstract 



In this work we show that, for any fixed d, random d-regular graphs asymptotically 
almost surely can be coloured with k colours, where k is the smallest integer satisfying 
d < 2{k — 1) log(A: — 1). From previous lower bounds due to Molloy and Reed, this estab- 
lishes the chromatic number to be asymptotically almost surely A; — 1 or fc. If moreover 
d > {2k — 3) log(A; — 1), then the value A; — 1 is discarded and thus the chromatic number 
is exactly determined. Hence we improve a recently announced result by Achlioptas and 
Moore in which the chromatic number was allowed to take the value k + 1. Our proof 
applies the small subgraph conditioning method to the number of balanced fc-colourings, 
where a colouring is balanced if the number of vertices of each colour is equal. 

1 Introduction 

The chromatic number x of random graphs is a topic that has attracted considerable interest 
since the breakthrough achieved by Shamir and Spencer [18], which marked one of the first 
applications of martingales in combinatorics. For the classical Erdos-Renyi model Q{n,p), a 
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celebrated result by Bollobas [7] later extended by Luczak [14] showed that if ^ oo then 
asymptotically almost surely (a.a.s.) 

iri \\ nlog(l/(l -P)) 
~ 21ogM • 

Here and in similar statements, an event occurs a.a.s. if its probability tends to 1 as n tends 
to infinity. For p = c/n^ Achlioptas and Naor [3] proved that the chromatic number of Q{n,p) 
is a.a.s. k or k + 1 where k is the smallest positive integer with 2/clog/c > c. Moreover, they 
discarded the case k for roughly half of the values of c. In the same direction, Coja-Oghlan, 
Panagiotou and Steger [8] showed that a.a.s. G {k,k + l,k + 2} for p < 

where k is the smallest positive integer satisfying 2/clog/c > p{n — 1). Meanwhile, some 
other results gave concentration of the chromatic number without determining the values so 
precisely: Luczak [15] proved that xiSin,p)) is a.a.s. 2-point-concentrated if p < and 
later Alon and Krivelevich [4] extended this to p < n^^/^~'^. 

More recently, results have been published about the chromatic number for the model 
Qn,d of random cZ-regular graphs, which is the probability space on d-regular graphs with n 
vertices having uniform distribution. For basic results and notation on random regular graphs, 
see [21]. Hereinafter, dn is always assumed to be even for feasibility. For fixed d, Molloy and 
Reed [16] showed that if q{l — l/qY^"^ < 1 then x{9n,d) > 1 a.a.s. Then, for d < n^/^"". Frieze 
and Luczak [11] estabhshed that 

d ( d\og\ogd\ 

and later Cooper, Frieze, Reed and Riordan [9] extended the same asymptotic formula to 
apply to d < ■n}~'^. Similarly, the range n^/'^+^ < d < O.Qn was covered by Krivelevich, 
Sudakov, Vu and Wormald [13], who showed that x(^n,d) ~ 21^^ a.a.s. where b = n/{n — d). 
Achlioptas and Moore [2] recently announced a significant new result for constant d. They 
stated that if k is the smallest integer satisfying d < 2{k — 1) log(/c — 1) then a.a.s. x{Sn,d) 
is k — 1. k, or /c + 1. If, in addition, d > {2k — 3) log(A; — 1), then a.a.s. x{0n,d) is k or 
k + 1. Finally, Ben-Shimon and Krivelevich [5] established 2-point concentration of x{Gn,d) 
for d = o(n^/^). 

In this paper we restrict the set of possible values for the chromatic number given by 
Achlioptas and Moore, and show that x{Sn,d) a.a.s. cannot he k + 1. Therefore this reduces 
the range of possibilities for x{Sn,d) to only a.a.s. k — 1 and k, in the first case, and establishes 
that x{dn,d) = k a.a.s. in the second case. In particular it provides an alternate proof of the 
results of Shi and Wormald [19, 20] that a.a.s. x{Sn,4) = 3 and xiOnfi) = 4. It also establishes, 
for example, the previously-unknown result that a.a.s. xiSn,w) — 5 and x(lO^) = 46523. We 
essentially need to show that Qn,d is a.a.s. /c-colourable, since the abovementioned lower bound 
of Molloy and Reed implies that Qn,d is a.a.s. not {k — 2)-colourable, and for the second case, 
not (/c — l)-colourable. Our basic approach for the upper bound is similar to that of Achlioptas 
and Moore, in that we analyse the second moment of the number Y of balanced /c-colourings 
of random regular graphs. In fact, Achlioptas and Moore found that the second central 
moment E(y — EY)^ is (essentially, in a more or less equivalent model of random graphs) 



2 



asymptotically a non-zero constant times the first. Consequently, Chebyshev's inequality 
fails to show the result which we claim above. In cases like this, this failure of the second 
moment inequality to establish y > a.a.s. can, at least for random structures similar to 
Gn,d, be overcome by using the small subgraph conditioning method of Robinson and the third 
author. (See [12, Chapter 9] and [21] for a full exposition of the method.) Using this, we 
show that Qn^d is a.a.s. /c-colourable. 

Theorem 1 Given any integer d > 3, let k be the smallest integer such that d < 2{k — 
l)log(A; — 1). Then the chromatic number of Qn,d is a.a.s. k — 1 or k. If furthermore d > 
{2k — 3) log(A; — 1), then the chromatic number of Qn,d is a.a.s. k. 

Actually, almost all previous applications of the small subgraph conditioning method were 
for a random variable that counted large subgraphs in the random graph. To apply the 
method in this setting we need to calculate the first and second moments of the number of 
balanced /c-colourings (n is then required to be divisible by /c), as well as joint moments of the 
number of such colourings and the number of short cycles. These computations are done in 
the well-known pairing or configuration model Vn,d which was first introduced by BoUobas [6]. 
A pairing in Vn,d is a perfect matching on a set of dn points which are grouped into n cells 
of d, points each. A random pairing naturally corresponds in an obvious way to a random 
(i-rcgular multigraph (possibly containing loops or multiple edges), in which each cell becomes 
a vertex. Colourings of the multigraph then correspond to assignments of colours to the cells 
of the model. The reader should refer to [21] for aspects of the pairing model not explained 
here. 

Proposition 2 Fix integers d,k > 3. Let Y be the number of balanced k-colourings of a 
random d-regular multigraph Vn,d (where n is restricted to the set of multiples ofk). 

(a) For m>l, let Xj^ be the number of m- cycles in Vn,d- Then 
and 

E(r [X,],, • • • [x,],j ~ n (1 + ^rn)r ny) (1-1) 

m=l 

where 



2m (A;-l)"*-i' 
(b) If furthermore d < 2{k — 1) log{k — 1) then 

B(Y^) fcfe(A; - l)fc(fc-i) ri-(^-^)A;'" ( 1 - 

^ ^ {k^-2k-d + 2)i^-^)'l\2T^{k-2)f-^ V k) 



dn 
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We next compute 

i:AX = (*-l)Mog(-,=^i=). (1.2) 

and verify that for n divisible by k 

which is the last ingredient required for the application of the small subgraph conditioning 
method. 

Proof of Theorem 1 (for n divisible by k). Assume throughout the proof that k divides 
n, and observe that all the conditions of Theorem 4.1 in [21] are verified by Proposition 2, 
(1.2) and (1.3). Thus we may apply the small subgraph conditioning method to conclude 
that P(r > I £:) ^ 1, where £ = A4=-i{^fc = 0} = {Xi = 0} is the event of having 
no loops. Because P {X2 = 0) is bounded away from for large n (see e.g. [21]), it follows 
that y > a.a.s. for the simple graphs in Qn,d, thus proving the required upper bound on the 
chromatic number. Now observe that from our choice of k we have d > 2{k — 2) log{k — 2) > 
{2k — 5) log(/c — 2). Then the required lower bounds follow immediately from the fact that 
if d > {2q — 1) logg then x{Gn,d) > Q (applied tog = A; — 2orq' = A; — 1 for each case in 
the statement). This is just a slightly weaker formulation of the result given by MoUoy and 
Reed [16]. (Their proof is reported in [20], Theorem 1.3.) | 

The following two sections supply the proof of Proposition 2. Finally, in Section 4, we 
conclude the proof of Theorem 1 by extending the argument to general n. 




2 Joint moments: proof of Proposition 2(a) 

Let Y be the number of balanced /c-colourings of a random d-regular multigraph Vn,d- For 
m > 1, let X„i be the number of m-cycles in Vn,d- We estimate the expected value of Y 
by enumerating all balanced /c-colourings of all multigraphs in Vn,d- There are (^^^ 
ways to choose the k colour classes. These choices are all equivalent so fix one. Suppose 
there are bij — bji edges between colour class i and colour class j (for 1 < i,j < k and 
i 7^ j). The colours of the neighbours of all of the points of colour class i can be then 
chosen in {dn/k)\/ Yli<j<k bij\ ways. After this determination is made, edges are constructed 

by putting a perfect matching between the corresponding points in each pair of classes, in 
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one of Y[i<i<j<k^ir ways. Thus we have 



k 



n \ / TT {dn/k)\ 



n/k,n/k, . . . ,n/k J ^ \ ni<j<fc biA 



n 



{dn/k)\'J2 




n/k,n/k,...,n/kj ' ^yUi<i<j<kbij^- 

'''''' ^ ;=i i<i<j<jfc i>o 

k . 
(c^n/A;)!'=[j]xf"/']exp( ^ 



n 

n/Zc, n/k, . . . ,n/k 



X^Xj I , 



l<i<j<k 



where square brackets denote the extraction of a coefficient from a generating function. A 
particular case of the following result gives us an accurate estimate of that coefficient. The 
proof is based on the saddlepoint method and is included later in this section. 

Lemma 3 Let k, d, oi, 02, . . . , Ofc be fixed integers with k > 3, d positive, and s — Yl^=i % 
even. Let Cai,a2,...,ak denote the coefficient . . . generating 

function expl^^^j^^^j^XjXi). Then as n — > 00 we have Cai,a2,.--,afc ~ where 



Hence we deduce 



Combining this with the well-known formula for the number of pairs on dn points, 

dn/2 



i„ I /, s.. (dn)\ /- f dn\ 



(2.2) 



and after some basic manipulations using Stirling's formula we obtain the estimate for E(y) 
stated in the proposition. 

Next we estimate the expected value of YX„i where Y is the number of balanced k- 
colourings and the number of length-m cycles. It is more convenient to count rooted 
oriented cycles, which introduces a factor of 2m into our calculations. It will be helpful to 
have the following definitions. For a rooted oriented cycle in a coloured graph, define its colour 
type to be the sequence T of colours on its vertices. For j = 1,2, . . . , k, let aj{T) denote the 
number of vertices in T which have colour j. Note that the sum aj{T) is m. 

To calculate the expected value of YXj^, we will count, for each balanced /c-colouring and 
each rooted oriented m-cycle, the number of pairings which contain this cycle and respect 
this colouring. 



5 



As before, there are (^^^ ^^^^,) ways to choose the balanced A;-colouring. All are equiv- 
alent, so fix one. To enumerate the cycles and pairings which respect this colouring, we will 
sum over all colour types T. Once a colour type has been chosen, each vertex of the cycle 
can be placed in the pairing model by choosing a vertex of the correct colour and an ordered 
pair of points in that vertex to be used by the cycle. Hence, in total, there are asymptotically 
{d{d — l)n/k)"^ ways to place the rooted oriented cycle in the pairing model. We now have 

where /(T) is the number of pairings which respect a fixed balanced A;-colouring and fixed 
rooted oriented cycle of colour type T. To count these pairings, suppose there are bij = bji 
edges between colour class i and colour class j (for 1 < i,j < k and j), excluding the edges 
of the prescribed cycle. The colours of the neighbours of all of the unmatched points of colour 
class i can be then chosen in {dn/k — 2aj(T))!/ ]^i<j<fc ways. After this determination is 

made, edges are constructed by putting a perfect matching between the corresponding points 
in each pair of classes, in one of Y[i<i<j<k^ir ways. Thus we have 

f(r) ^ (tJ i.dn/k-2ai{T))\ 
^ mdn/k-2a,{T))\ 
[dnlk)^}" ^ 1 




{dn/k)\^ r A dn/k-2MT)-\ TT V- {XjXj)^ 

idnlkf^V^^' J 11 l\ 

^ I ' 1=1 l<i<3<k l>0 

{dn/k)\ r -i-r ^dn/k-2ai{T) 

{dn/kf^ V\}J 



X, 




By Lemma 3 the asymptotic value of the coefficient in the last expression is C(— 2m), making 
the entire expression independent of T. Moreover, the number tm of possible colour types for 
a rooted oriented cycle of length m satisfies the obvious recurrence tm + tm-i — k{k — 
with ti = 0. So we have tm= {k — 1)™ + {k — 1)(— 1)™ and therefore 

^/ n \( d{d-l)n y 1 {dn/kr 

^ 2m\n/k,n/k,...,n/kJ \ k J \Vn,d\ (dn / k)^"' 

x{{k - 1)"* + {k- l){-l)'^)C{-2m). 
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Comparing this expression with (2.1) we see that 

((, _ 1,™ ^ (, _ i)(_i,.,.,_l_c(_2„,/C(0) 

The above argument is easily extended to work for higher moments, by counting the pairings 
that contain a given balanced fc-colouring and set of oriented cycles of the appropriate lengths. 
The contribution from cases where the cycles intersect turns out to be negligible, for the 
following reasons. Suppose that the cycles form a subgraph H with v vertices and edges, 
and the total length of cycles is vq. Then in the case of disjoint cycles, v — ji — vq. A 
factor of Q{n!^°^^) is lost if there is a reduction in the number of vertices of H, compared 
with the disjoint case, because of the reduced number of ways of placing the cycles on the 
coloured vertices. Similarly, a factor Q{py°~^) is gained in the function / for the reduction in 
the number of edges of H, because of the corresponding increase in the number of points to 
be paired up at the end. Thus, the contribution from such an arrangement of cycles to the 
quantity being estimated is of the order of 'ny~^ times that of the contribution from disjoint 
cycles. In all non-disjoint cases, H has more edges than vertices, since its minimum degree 
is at least 2, and it has at least one vertex of degree at least 3. There are only finitely many 
isomorphism types of H to consider, so the contribution from the case of disjoint cycles is of 
the order of n times the rest. The significant terms in this case decompose into a product of 
the factors corresponding to the individual cycles, and we obtain 

E(y [Xi],, . . • [x,],j/E(y) ~ n (1 + '^-))"" 

m=l 

as claimed. | 

It only remains to prove Lemma 3. Before doing so, we need the following result, which 
will be used several times in the paper. 

Lemma 4 Let k he a positive integer. Define the function / : R'^ — > C hy 

f{e) = ia{n,e) -CinO^Be 

where i is the imaginary unit, a is a real function, B is a fixed k-by-k positive definite real 
matrix, and Ci > is a real constant. Let S = C2n~^^^\ogn for some real constant C2 > 0. 
Then, as n ^ oo, 

for some constant c > 0. 



E(y) 
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Proof. Note that in order to bound the integral of e^'^^^ over [—00, oo]'^\ [—5, 5]^ we only need 
to consider the real part of f{9). Moreover, since B is positive definite we have 9^B9 > X\9\'^ 
where A > is the smallest eigenvalue of B. The proof is elementary in view of these two 
observations. | 

Proof of Lemma 3. We will use the saddlepoint method. First we use Cauchy's formula 
to express Ca,^^a2,...,ak ^ integral over the product of circles Zj — re*^^, — tt < 9j < tt for 
J = 1, 2, . . . , A;, where r = ■\Jdn/k{k — 1). 

_ I f f f ^^p{j2j<i^jZi) 

Caua2,...,ak - 7W)fe / , / , ' ' ' / , dn/k+ai+l dn/k+a2+l dn/k+a^+l ^^1^^2 ■ ■ ■ dZk 

(27r)''' J J J (j-QiOi'^dn/k+ai^j,Qi92'^dn/k+a2 . . . ^j-QiOk^dn/k+ak 12 k 

1 r r r exp(r^EK/^'^'^'^''^) 

(27r)M"+^IJ_/'7-.exp(iE,(rfrzA + %)^,) 

Let g{9) denote the integrand in the last expression above. Letting 1 denote the vector of 
I's, consider the image of g{9) under the transformation 9 ^ 9 + ttI. It is clear that the 
numerator is fixed by this transformation. The denominator becomes 

exp{i'^^{dn/k + aj){9j + n)) — exp(i '^^^{dn/k + cij)9j) exp{i{dn + s)7r) 
j j 

— exp(i y~]((in/ k + aj)9j) 
j 

since dn (the sum of the vertex degrees) and s are both even. So g{9) is fixed by this 
transformation. Letting 6 = logn/y^, this means that the integrals of g{9) over regions 
{9 : \9j\ < 6,j = l,2,...,k} and {9 : n - 5 < \9j\ < tt, j = 1, 2, . . . , k} are equal We will 
prove that the integral / of g{9) over each of these regions is asymptotically equal to 



(2^)fe/2 (^HLJiy' (2k - 2)-''\k - 2)-('=-^)/^ 



= Xexp ^dn/2 — logn^ , 

where 7^ is a constant, and we will show that the integral over the remaining regions is 
asymptotically smaller. From these results the proposition follows. 

To prove that the integral over vectors 9 in the remaining regions is asymptotically smaller, 
there are two cases: either \9j*\ < S and ir — 5 < \9i*\ < tt for some distinct j* and 1*, or 
S < \9j* I < TT — 5 for some j*. 



8 



In the first case, suppose that \6j*\ < 5 and tt — 5 < \6i*\ < n for some distinct j* and I* 
Then tt — 2S < \9j* + 9i* \ < tt + 2S and hence cos(6'j* + 6*;*) < 0. So 



exp 



3 

,2 



< exp ( r'( ( ^ ) - 1) + cos(^j* + 9i* 
2, 1 k 



dn dn 
= o(/). 

In the second case we suppose that S < \9j*\ < n — 5 ior some j*. If there is a value of I* 
for which \9j* + 9i^\> 5/2 then 5/2 < \9j* + 9i^\ < 27: - 5/2. This means 

5"^ 

cos{9j, + 9u) < cos(5/2) = 1 - — + 0(5^) 

8 



and hence 



exp ^■'^Y^^cos{9j + 9i)j 

■ 1) + r\os{9j. + 91,)^ 



3 

k 



< exp ( 

= exp(r^(Q)-l)+r^(l-^ + 0(0) 
= exp(r^Q-r2^ + 0(rV) 

^ expf^-4)^ + o(l) 
^ V 2 8k{k - 1) ^ ^ 

= 0(7). 

Otherwise, there is no such /*. That is, for all /* not equal to j* we have \9i* — {—9j*)\ < 6/2. 
This implies that all 9i with I ^ j* have the same sign and satisfy 5/2 < \9i\ < tt — 5/2. Since 
A; > 3 we can choose two distinct such say I* and /**, and deduce 

5 < \9i* + 9i**\ <2n- 5. 

Using the same argument as above, it follows that \g{9) \ = o{I). 

This completes the proof that the integral of g{9) over these regions is asymptotically 
negligible, as claimed. 
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It remains to show the integral of g{6) over the region {6 : \6j\ < 6,j = 1,2, . . . , k} is 
asymptotically equal to /. We begin by expanding 

log^(^) = Q + z{r\k - 1) - dn/k + 0(1)) ^ 6, - V ^{6, + + 0(r^ I^^H 

^ ^ .7=1 -i<l .7 = 1 



j<l 3= 



since = dn/{k{k — 1)) and < 5 = logn/y^ for all j. The quadratic order term can 
be written as — |^^X]j<i(^i + ~ AG- Here, 6"^ denotes the transpose of the column 

vector 6 and A is the matrix A = r^(ll^ + {k — 2)/^), where Ik is the k-hj-k identity matrix. 
By Lemma 4 and since A is positive definite, we have for some constant c > 

/CXD /"OO POO / 1 \ 

/ • • • / exp ( --e'^Ae d9id92 ■■■d9k + 0(e-"^°s'"). 
oo J -oo J -oo \ ^ J 

It is well-known (see Equation 4.6.3 in [10]) that such integrals have the value (27r)'''/^(detA)"^/^, 
giving us 



oo foo roc 

exp 



Y'J2^0j + 9i)Ad9,d92---d9k 
= (277)^=/^ {r''\2k - 2){k - 2)'=-^)"'/' . 



oo J —oo J — oo 



We conclude 

"5 rS 1-5 



j I ■■■ I 9{9)d9id92---dek ~ e'"i'){27r)''^^r^\2k-2){k-2)''-^) . 

(2^)fc/2 {2k - 2)-^l\k - 2)-('=-i)/2 



~ e 
= I 



as claimed. 



3 Second moment: proof of Proposition 2{h) 

Throughout this section, fix positive integers d and A: > 3 satisfying d < 2{k — 1) log(A; — 1). 
Assume 2 divides dn and k divides n. Let Ci and C2 be balanced A;-colourings of a pairing 
P £ Vn^- The colour count of (Ci, C2) is the /c-by-A; matrix M — [nip^q] where mp^qn/k is the 
number of cells coloured p in Ci and coloured q in C2. Let A4 be the set of k-hy-k doubly 
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stochastic matrices (i.e. nonnegative matrices with each row sum and column sum equal to 
1). Since the colourings are balanced, we must have M G Ai. Define T[M) to be the set of 
triples (P, Ci, C2) where P e Vn,d and (Ci, C2) is a pair of balanced A;-colourings of P having 
colour count M. Then, 

^(y-)- E (3.1) 



In order to estimate the sum in (3.1), we first obtain an exact expression for |T(M)|, where 
M — [rup^^ is any k-hy-k doubly stochastic matrix whose entries are integer multiples oik/n. 

For all 1 < p, g < /c we must choose nip^qn/k cells to be assigned the colour p in the first 
colouring and q in the second colouring. We say that such a cell and its points have label 
{p, q). The number of ways of doing this is given by 

the multinomial coefficient 

n! 

n {mp,gn/k)\' 

l<p,q<k 

Now we must select the edges of the pairing in a way which is compatible with the two 
colourings. Suppose we know the number bpgrs of edges from points labelled (p, q) to points 
labelled (r, s) for all 1 < p,q,r, s < k with p ^ r and q ^ s. Then we choose, for each ordered 
pair of labels {{p, q), (r, s)), which bpg^s of the points labelled (p, q) will be paired with points 
labelled (r, s). The number of ways of doing this is 



{dnip^gn/ky. 



n 



Finally, for each unordered pair of labels {{p,q), {r,s)}, we choose a bijection between the 
points labelled {p, q) and the points labelled (r, s) that were designated to be paired with 
each other. The number of ways of doing this is 

bpqrs ■ ■ 

l<p,<7,r,s<fc 

Observe that the only restrictions on bpgrs required in our counting are 

bpqrs^Kspq Vp, ^f, T, S G {1, . . . , /c}, T 7^ P, S 7^ g, (3.2) 

bpgrs ^ drup^gn/k Vp, g e {1, . . . , A;}. (3.3) 

l<r,s<fc 

Thus, the total number of triples (P, Ci, C2) in T{M) is 

iT(M)i=.!(n ^1^) E n ^. (3.4) 
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where B{M, n) is the set of tuples of non-negative integers {bpqrs) i<v,q,r,s<k satisfying (3.2) 

and (3.3). Note that (3.4) is the expression used in [2]. 

As we shall see later, the main weight of the sum in (3.1) corresponds to terms in which M 
is 'near' {l/k)Jk, where Jk denotes the k-hy-k matrix of ones. To state this more precisely, for 
5 > and any positive integer p, we define Bp(5) to be the set of p-by-p matrices M — [rriij] 
for which (l//c)| < S. It will be shown that the essential contribution to (3.1) 

comes from terms such that M G Bfc(e logn/n^^^), where e > is a small constant that will 
be specified later. Thus we now proceed to bound \T{M)\/\Vn,d\ for each M G 7W fl {k/nJlJ^ , 
and then find more accurate asymptotic expressions for M G (e log n/n^/^). 

We begin by estimating the ratio of factorials in (3.4). Recall that one can write Stirling's 
formula as x\ = ^{x){x/eY where ^ is a function satisfying ^{x) ~ VSttx as x — > oo and 
^{x) > 1 for all x>Q. Thus, 

yr {dn,,,,n/k)\ ^ e(^//»;v,///A-)(^////,,,/V(A-0)""'"""/' 
i<pi<fc ("^fW^)! i<p|<fc i{mp,qn/k){mp,qn/{ke))"'P''^^l^ 

l<p,q<k P''^ ' ' 

where in the final step we used X^i<pg<fc"^p,'? = ^ which holds because M is doubly stochastic. 
(Throughout the article we use the convention 0° = 1 and OlogO = 0.) Moreover, since 
^{nip^qn/k) > 1 and ^{drrip^qn/k) = 0(n^/^) for each rrip^q, we obtain the following bound, 
which does not depend on the particular M. 

n 'i^ = o(n-ny^{^r^''{ n ^.rA . (3.6) 



(mr)on/k)\ V / \ek/ 

l<p,q<k ^ P^l I ' 

Next we bound the inner sum in (3.4), by further applying Stirling's formula, and also using 
the obvious crude bound \B{M,n) \ = 0{n!^ ) on the number of terms. 

E n E n 



B{M,n) i<P,i,r,s<k ^(^M,n) ^<P,<i,r,s<k ^iWqrs) (bpqrs 

~ n (h /p)^pqrs 

p<r,qj^s 



V J B{M,n) h 



l<P,q,r,3<k ^Pqrs 
p<r,q^s 



Upqrs 



The following result allows us to derive a more explicit bound, conveniently expressed in terms 
of M and n. The proof follows the ideas in [2] and is given later in this section. 
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Lemma 5 Let M = [rripg] be a fixed m,atrix in M., and let {bpgrs) i<p,q,r,s<k be any tuple of 
non-negative reals satisfying (3.2) and (3.3). Then, 



p<r,q=^s 



n 



l<P,q,r,s<k ^PQf^S 



< 



Hence, we immediately deduce that 



Til '""PyQ 

l<p,q<k "''P,Q 



dn 



dn/2 



V TT ^ = O fn^'") ( ^ 

B{M,n) ^<P,<i,r,s<k ^PVS- ^ \LLl<p,q<k"''P,q 

p<r,q^s 



dn/k /y-^^ ^i. 177 m \ '^"/^ 



P7^r,q:^3 



dn/e 



(3.7) 



Finally, define for any M & M. 



1 d \ 



1^ '^p,i^''Smp,g + -log\l-- + — 2^ mp,, 

l<p,q<k \ ^I^P^q'^k 



(3.8) 



(Recall the convention 0° = 1 and OlogO = 0.) By combining (3.6), (3.7), the Stirling-formula 
estimate n\ ~ \/27rn(n/e)" and (2.2), we have 



n,d\ 



\T{M)\ _ ^r^{nleY 
72(d^Ve)*^ 

X O 



rrir. ""^'^ 



(d-l)n/fe 

II "M 

l<p,g<fe 

dn/fc ^i. m TO \ ''"/^ 

2_^\<p,q,r,s<k lll'p^qlll'r,S \ 



(\ n/fc / 
n ^—1 1^ y 
l<p,q<k ^'^ / \ i<p,q,r,i 



^p,q^r,i 



q,r,s'^k 
P7^r,q^3 



\ dn/2 



< poly {n)k"-e 



n Hip{M) 



(3.9) 



for some polynomial poly(n) not depending on the particular M. This is a sufficient bound 
on \T{M)\/\Vn,d\, since it will allow us to show that the sum in (3.1) receives a negligible 
contribution due to terms with M away from {l/k)Jk- 

It remains to find a suitable asymptotic expression of \T{M)\/\'Pn,d\ for M e Bfe(elogn/n^/^), 
by improving our previous estimates in (3.6) and (3.7). Before that, we state two technical al- 
gebraic results which will be needed in the asymptotic calculations. Hereinafter, 1 represents 
the fc-dimensional vector of ones, while 1*^*) represents the /c-dimensional vector with entry 
1 at position i and elsewhere; denotes the k-hy-k identity matrix; vecA is the vector 
formed by stacking the columns of a matrix A to form a single column vector; and A^^ is 
simply A(S) A, with the standard notation (8) for the Kronecker product of matrices. 
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Lemma 6 Consider the vectors 

Define f^'i^ = f^^ (g) forl<p,q< k. 

(a) An orthonormal basis of eigenvectors for the matrix ( — 7^)®^ + {k — 1)^/^2 is given by 

{f^''^^p^q=i with corresponding eigenvalues 

Ap^^ = k'^ -2k + 2, l<p,q<k-l 

Ap^fc = (A; - - 2), l<p<k-l 

Xk,q^{k-l){k-2), l<q<k-l 
Xk,k = 2{k - If. 

The smallest of these eigenvalues is {k — l){k — 2). 

(b) Similarly, the eigenvectors of {Jk + /fe)®^ are also and the corresponding 

eigenvalues are 1 with multiplicity {k — 1)^, k + 1 with multiplicity 2{k — 1), and {k + 1)^ 
with multiplicity 1. 

Proof. Immediate by checking that the eigenvectors satisfy the required properties. | 

Lemma 7 Let A — [oij] be a k-by-k matrix whose rows and columns each have sum 0. Define 
A to be the suhmatrix formed from A by deleting the last row and column. Let {/^^'''^If j=i be 
the orthonormal basis defined in the statement of Lemma 6. Then, 

(a) (vecAy f^''''^ = (vecA^ f^'''^^ ^0 fori <i,j < k, 

(b) EE ((vecA)V^-O' = EE<.> 

i=l j=l i=l j=l 

k k 

(c) (veci)^( Jfe_i + h-if^ecA = ^2^2"' 

i=i j=i 



The proof of Lemma 7 is given later in this section. We now proceed with the asymptotic 
calculations hi M e M (1 {k/n)Z^'' n Bfc(e log n/n^/^). So define the matrix A = A{M) = 
[ttp^g] by A = M — (l//c)Jfc. Note that each row and column of A must have sum 0, and 
moreover Up^g < elogn/n^^^ for each p,q e {!,..., A;}. Let A be the submatrix formed 
from A by deleting the last row and column. Under the new assumptions, we first derive a 
new asymptotic formula for the expression computed in (3.5). For rup^g — 1/k + ap^g with 
Qp^g — 0(logn/n^/^) we have 

^{dmp^gu/k) ^ ^ dmp^gu/ k ^ ^ 
i{mp^gn/k) ^/m~njk 
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for 1 < p, g < A;, and we expand 



l<p,q<k l<P,q<k V / V / 



l<p,q<k 



k (^-logk+ ^(veci)T(Jfe_i + 4_i)®Veci^ + OQog^n/n^/^) 



where we used Lemma 7(c) to rewrite ^ a^^ in the final step. Combining these estimates 
we can rewrite (3.5) as 

n ~ i"'"''" (4) exp („l^(veci)^( + 4-)«Veci) . 

l<p,q<k V P>9 / / \ / 

(3.10) 

Next we rewrite the inner sum in (3.4) in terms of the natural generating function, letting 
square brackets denote the extraction of a coefficient 

T. n = [ n n 

B{M,n) i-<p,q,r,s<k Pl'^^ l<p,q<k i<p,q,r,s<fc i=o 

p<r,q^s p<r.q^s 

~ [ n ""^P'l ^'"^ ^ ^"^P ( 2 ^p,q^r,s j ■ 

i'^P,q'^k i<P,Q,r,s<k ^ 

P9^r,q^s 

The following result provides an asymptotic characterisation of that coefficient. The proof 
uses the saddlepoint method, and can be found at the end of the section. Note that our 
still-unspecified e is determined by the statement. 

Lemma 8 There exists e > such that for each M e Mn {k/n)!^ fl Bfc(elogn/n^/^) the 
coefficient C of 
satisfies 



T\i<n o<k Xp^q^'^"'"'^/^ in the generating function exp ( | ^ i<v,q,r,s<k Xp^qX^^s I 



-f{k) iV k{k-i) 



^ ^ \dn} I ^dn/e 



f-nd{k - l)2(veci)T(4_i + 4_i)®2veci\ 
1^ 2{k^-2k + 2) J ' 



. , A;'='(A;-1)*^M 
where 'y[k) is the constant 'y[k) — 



(27r)'=V2-l/2(A;2 -2k + 2)('=-l)V2(A; - 2)^=-! ' 



Hence, in view of (3.10) and Lemma 8, we obtain for M e M n {k/n)Z^' n Bfc(e log n/n^/^) 
an asymptotic expression for (3.4) which improves the bound already stated in (3.9) 

\T{M)\ (A;-!)-^" / e-2k-d + 2. ^ .^^ ~\ 

^ - 7(^) ,.42-V2,V2)n exp + h-.rvecAj , 

(3.11) 
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where '~f{k) is the constant defined in Lemma 8. 

We are now in good shape to estimate the sum (3.1). We begin by computing the con- 
tribution of the terms near {l/k)Jk- For a {k — l)-by-(/c — 1) matrix M define M to be the 
k-hy-k matrix formed from M by adding a new row and column so that every row sum and 
column sum is 1. Recall the definition of 6^(5) and define Bp(5) = {M \ M e Bp((5)}. Now 
we set 

e log n 
" (A;-l)2nV2' 

and consider M = M' for M' e Bfe_i((5) fl |Z(''~^)^ A straightforward application of the 
triangle inequality shows that Bfe_i(5) C Bfc((/c — 1)^5), and therefore M e Bfc(elogn/n^/^). 
So M is nonnegative (for large enough n) and hence M G Furthermore, the entries of M 
are in because they are integer linear combinations of k/n and 1 — ^ x ^, using the fact 
that k divides n. This shows that 

\T{M)\ 

\'^n,d\ 

is a term in the sum (3.1), suggesting that we express (3.1) as E(y^) — Si + S2 where 



M'eBfc_i(5)n^2 



and 5*2 is the sum of the remaining terms. Notice moreover that if M' G B^-i^S) fl ^Z*^^ -^^^ 
and M = M', then the matrix A = M — (l/k)Jk has all entries ap^g < en~^^^ logn. Hence the 
expansion given in (3.11) is valid, and we can express 

S,r^-f(k) ~ ^"'^'^ V e"-^(^') (3 12) 

M'eBfc_i(5)niz('=-i)^ 



where 



/(MO = -^^^^^^^(veci)T(4-i + h-,r\ecA, 



and A = M' — {l/k)Jk-i. By iterating the Euler-Maclaurin summation formula (see [1], 
p. 806), we have 

Setting = 2(fc2^2fc'+2) ('^^'-^ + -^fe-i)'^^ and using Lemma 6 (b) for the eigenvalues of {Jk-i + 
/jk_i)®^, we deduce that H is positive definite and has determinant 
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Here we used the fact that /c^ — 2/c — + 2 > 0, which is guaranteed from our assumptions on 
d and k. Now we may apply Lemma 4 to conclude 

{k - If"" (n\ik-i? (27r/n)('^-i)V2 



^ ~ (- 



J |detii'|V2 



{k^-2k-d + 2)(^-i)V2(27r(A; - 2))^" 



-C^-^^e-l^l-^Y ■ (3-14) 



To prove the proposition it suffices to show 5*2 = 0(5*1). Let M be an index of any term of 
S2. This implies M Bfe_i((5), so we must have M e M\Bk{d) since Bk{d) DM Q Bk-i{d). 
Recall now the definition of ip in (3.8), and obtain by direct substitution 



^ \ k'^'') ^ ^^Sk + dlog -j . 

From Theorem 7 in [3] (see also (5) in the same paper) we have that if d < dk-i := 2{k 
1) log(A; - 1) for each M e M, 



4 E^f'^'-i 



k J A{k 



1 \ dk_i — d 2 



In particular, for each M G \ 6^(5), 

^(M) < log^ + dlog (^1 - - J - (3.15) 

By combining (3.15) with the bound on the general term obtained in (3.9) and also taking 
into account that the number of terms in 5*2 is at most Oin!^ ), we conclude 

1 \ / 4-1 - d 



S2 = 0{nnpolj{n)k'- ( 1 - -J exp j^-^^i—^dog^ n 



< poly'(n)A;2" (^1 - 1^ 



dn 

2« ( 1 _ i ^ ^-G(logn) 



for some polynomial poly'(n). Thus 5*2 = o{Si), and this completes the proof. | 

It only remains to prove Lemmas 5, 7 and 8. 
Proof of Lemma 5. Let C = C{M) be the polytope consisting of all non-negative tuples 

L = (ipqrs) i<p,9,^,«<fe in M''^(''-i)^/2 such that 



E ^pqrs = dmp^g yp,qe {!,..., k}, (3.16) 



l<r,s<k 
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where for p > r and q ^ s we used the duphcate notation Ipq^g — £rspq to denote the 
coordinates. For each L E C, define 



n 



l<p,q<k -I 

Our aim is to show that for all L e £ 



dk/2 



^{L) < I ^^"^^^^ I . (3.17) 

This is indeed equivalent to the statement of the lemma, after setting bpgrs — ipgrs^/k and 
performing straightforward manipulations. 

As a first case, assume that rUpg > for all p, g in {1, . . . ,kj. We define jC to be the 
polytope of all non-negative tuples L = (ipqrs) i<P,q,r,s<k in R'^ ('^"^^ such that 

P<r,q^s 

^Pirs = dk/2, (3.18) 

i< 

and for each L G £, let 



l<p,q,r,s<fc 



kL)= n f^)' 

l<v.a.r.s<k V WS / 



Observe that C <Z and moreover the restriction oi io C is equal to ip. Our goal is to 
maximise log-?/' over £, and thus provide a bound on over C Wc first show that Xogip does 
not maximise on the boundary. Note that the boundary of C consists of points having some 
coordinate (but at least some coordinate must be strictly positive). Let us choose an arbitrary 
boundary point Lq, and assume without loss of generality that £1122 = and ^1221 > 0. Let 
be the point with the same coordinates as Lq but replacing £1122 by e and £1221 by £1221 — e- 
For small enough e > 0, Lg lies in £, and moreover 

lim -f- log'?/'(Lg) = +00, 
e-»o+ de 

so the maximum cannot occur at Lq. Hence, log-^ is maximised in the interior of C and the 
maximiser(s) must satisfy the following Lagrange equations 

log(mp,gm^,s) - loglpqrs - 1 = A Vp, g, r, s e {1, . . . < r, g 7^ s. 

These are equivalent to 

£pqrs = rnp^qmr,se~^~^ yp,q,r,s e {1,. . .k},p < r,q s, 

which combined with (3.18) yield 

777- Tn 

ipqrs = dk^ M_!:if yp,q,r,s e {1, . ..k},p <r,qy^ s. 

2_^l<p',q',r',s'<k TUp'^q'mj.'^s' 
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From the uniqueness of the solution, we deduce that it must be the maximiser of log ip (and 
The value of 'ip at this point can be easily computed by substitution 



dk 



dk/2 



which proves the bound in (3.17) under the assumption that nipq > for all p and q. To 
extend the argument to the other cases, we first define 

V = {(M, L) \ M e M, L e jC(M)}, 

and with a mild abuse of notation denote by 



i^{M,L)= H 



"l'p,q 



i<P,5<fcn;|;.;J^ -PITS 

the natural extension of ijj to V. Notice that ip is continuous in V. We showed so far that, 
for any M in the interior of M. (i.e. nipg > 0) and any L E C{M), 

dk/2 

^<p,<i,r,s<k TTlp qUlr^s \ 



^{M, L) < 



dk 



Hence, this inequality can be extended by continuity to any M on the boundary of M., and 
thus to any (Af, L) G V. This concludes the proof of (3.17). | 

Proof of Lemma 7. We begin by proving (a). Let j E \1, 2, . . . ,k}. Since f^'^'^^ = f^'^^^f^^^ 
is a linear combination of terms of the form 1 ® 1^'^) = X^p=i(l'-^-' ® l*-'^-'), (1 < q < k), we have 
that (vecA)^ f^'^'^'' is a linear combination of terms of the form Y^p^ii^GcA)^ {1^^^ (g) l'^'^^) = 

Ylp=i (^g,p — since the row sums of A are 0. A similar argument shows {vecA)~^ f^^''''' — for 
i e {1, 2, . . . , /c} using the fact that the column sums of A equal 0. 
To prove (b) we apply (a) to write 

X: E {{vecAyf^-^^Y = E {{^ecAr f'"^')\ 

1=1 j=l i=l j=l 

which is the sum of the squares of the coordinates of vecA in the basis given by {/^*'-'''}fj=i- 
Since the basis is orthonormal, this expression is simply the square of the norm of vecA with 
respect to the standard basis, Yli=i Yl'j=i ^Xj- 
To prove part (c) we begin by writing 

k k 

EE"?. 

i=i j=i 

k-l k-1 k-1 k-1 

1=1 j=l i=l j=l 

'k-1 k-1 ^2 I, 1 / 1 \ 2 

EE 

j=i j=i 
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Since 



EE«^-^) = (veci)^Jf ,veci, 
i=i j=i J 

2 

= {vecAy{Jk-i ® 4_i)veci, 
= (vecA)'^(Jfc_i (8) Jjt_i)vec^, 



EE«l-(veci)^/fivec^, 

i=i i=i 

and (Jfc-i + = Jf_\ + (Jfc_i ® 4_i) + {h-i ® Jfc-i) + part (c) is proved. | 

Before proceeding with the proof of Lemma 8, we need the following technical result. 

Lemma 9 Let d G (0, 27r/5) and fix an integer k > 3. For each 1 < p,q < k, let —n < 9p,q < 
n. Suppose maxp^^ \(^p,q\ > ^ (^i^'d niinp^g 16*^,^1 < tt — 5. Then there exist p, q, r, and s with 
p ^ r and q s such that 

S S 
2<IV + ^r,s|<27r--. 

Proof. There are two cases. In the first case, suppose S < \0p,q\ < tt — S ior some p and q. 
Let 5" be the set of pairs 

S = {(r, s)\l<r<k,ry^p,l<s<k,sy^q}. 

The set S is nonempty as A; > 2. If there exists (r, s) E S with \9p^q + 9r,s\ > S/2 then 

— < \9p,q ~\~ 9r,s\ — \0p,q\ ~^ \^r,s\ 

< TT — 5 + 7r 

5 

< 2n-- 

and we are finished. Otherwise, all 9r,s with (r, s) G S satisfy \9p^g + 9r,s\ < S/2; i.e. they 
arc all within S/2 units of —9p^q, and so, because 6 < \9p^q\ < n — 6, they all have the same 
sign and satisfy d/2 < \9r,s\ < tt — 6/2. Now let (r, s) G S and choose any {t,u) with 
t G {1, 2, . . . , A;} \ {p, r} and u G {1,2, . . . , k} \ {q, s}. This is possible because k > 3. Since 
{t, u) & S we have, using the above observations, S < \9r,s + (^t,u\ <2t: — 5, which implies the 
required result. 

For the remaining case, we must have \9p^q\ G [0,5] U [tt — 5, tt] for all 1 < g < k. We 
claim there exist p, q, r, s with p ^ r, q ^ s, \9p^q\ G [0, 5], and 16*^^51 G [tt — 5, tt]. If we prove 
the claim then we are finished because 

6 5 

KIT -25 <\\9„a\ - \Ors\\ < Wva + s\ < Wr> a\ + \0r s\ <S + 77 <2'K - -. 




2 
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Assume for contradiction that the claim is false. By the hypothesis of the proposition there 
exist p and q with \Op^q\ < tt — 6. Since 16*^^51 G [0, 5] U [tt — S, tt] for every 1 < p,q < k we must 
have 16*^^51 G [0,6]. Since we are assuming that the claim is false, we must have ^ [0,5] 
for the {k — 1)^ pairs (r, s) with r ^ p and s ^ q. But the hypothesis of the proposition also 
gives us (t, u) with \9t,u\ > 6, so an argument analogous to the previous one shows there must 
exist {k — 1)^ pairs {v,w) with |6'„^^| G [tt — 5,-k]. Since {k — 1)^ + (A: — 1)^ exceeds /c^, the 
total number of ordered pairs, we have a contradiction, as required. | 

Proof of Lemma 8. We use the saddlepoint method to estimate the coefficient C in the 
statement. Using Cauchy's integral formula, C can be written in terms of an integral around 
the product of circles Zp^g — Pp^qexp{i9pg), — tt < 9p^g < tt, {1 < p,q < k), as follows, 

1 /• exp (I E Zp,qZr,s ) 

^ ^ l.Lp,q'^P,<l p,q 



g{9 + 7rl) 



{'2^rUp,gPi7''"'^' 

r exp (i Ep/. Pp,5Pr-,.e^(^^-«+^'-.^)) 

X / ^ ,.Ji' , — 7T^T\d9pq. (3.19) 

exp{iJ2p,g0p,gdmp^gn/k) ' 

Viewing ^ = vec([^p^^]) as a /c^-dimensional vector, let g{9) denote the integrand in the above 
expression. Consider 

exp (|E^^'-Pp,.Pr,.e^(^-+'-+'")) 
exp(i Ep,g Qp,qdmp^gn/ k + in Y.p,q dnip^qn/k) 

exp (\ Pp,qPr,se'^^'''''^^'''A 

exp(i Ep,5 dp,qdmp^qn/ k + mdn) 

which holds since Ylipq^p,q — ^ ^^'^ dn is even. Setting 6 = logn/y^, this tells us that the 
integral over the region {9 \ \9p^q\ < 5 ior 1 < p,q < k} equals the integral over the region 
{9 \ TT — 5 < \9p^q\ < TT ior 1 < p, q < k} . Set each pp^q to be the common value 

pp^'^^p^ k{k-iy 

We will see that the integral over each of these regions is asymptotic to 

^ .^.k^n k^\k - i)^(^-^)exp 

^ ^/2 \dn) (A;2-2A; + 2)(^-i)V2(A;- 2)^-1 

> Ke^pl^-e'lo^T^ (3.20) 
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(using ttp^q < elogn/n^/^) where K and e' are constants depending on e. We will also show 
that the integral over the remaining region is o{I). The lemma then follows by combining 
these two facts with (3.19). 

To see that the integral over the remaining region is o{I) , let 9 be any vector in this region. 
By the definition of this region we must have min^^^ \ 9p,q\ < ti — S and maxp^g \^p,q\ > ^- By 
Lemma 9 there exist p*,q*,r*,s* e {1,. . . ,k} with p* ^ r* and q* ^ s* such that 

S 5 
2 — \^p*,q* + 9r*,s* \ < 27r — -. 

Now 

2) " ^ -J + ^i^^) 

so the absolute value of the integrand is 



mi 



exp 



V 



q^s 



exp -p' i^ik - 1)2 - 1)1 + 1 - ^ + 0(5') 



8 



= exp 
= exp 



Ip'k^k - If - p^^^ + 0{p'n-'/\lognf) 
2 Ibn 



dn 



d{\ogn) 



+ 0(1) 



2 16fc2(fc-l)2 

recalling that we chose p = k~^{k — \)^^\fdn and 5 = logn/y^. Hence \g{0)\ = o{I) if we 
choose e sufficiently small so that the constant e' in (3.20) satisfies e' < d/ {16k'^{k — 1)^). 
It remains to show that 

g{e)de ~ I. 



lei 



For 9 e [—5, 5]''^ we have 

log^(^) = p^i 5^ (1 + i( V + 9r,s) - i^-±^ + 0(1^1^)) -zfYl ^v.mp. 



q^s 
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Regrouping the terms and substituting nip^q — k ^ + Up^g this becomes 



^2 



log9(«)= ^k^k-if 



2 



p,q vi^r 



J 



Let c be the constant c = d/ {2k!^{k — 1)^). Recalhng p = k^^{k — ly^^fdn we find 

\ogg{e) = — - AwecAye - cnO'^ BO + 0(n-^/2(logn)^) (3.21) 

2 

where S is the matrix 

B = {k- Ifh^ + {Jk - hT^. 

Define h{d) = —i{dn/k){\ecAy9 — cn6^B6. Lemma 6 gives us an orthonormal basis 
{f^^''^^}pg=i of eigenvectors for B and corresponding sequence of eigenvalues {^p.q)p g=i- In- 
troduce the new variables (rp^g)^^^^^ to perform the change of basis 9 = J2pqf^''^^^p,q- This 
gives 



= 5] {-i{dn/k){vecAyf^''^\,g - cnXp,,Tlg) . 



Let p,q e {1, . . . , /c}. Using the identity 



^ax-hx"^^^ = W -exp 



(for 6 > 0), we have that ^■^k'^ exp{h{9))d9 is a product of terms of the form 



9 



tT- f -d^n{{vecAy f^''}^) 



exp 
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So by Lemma 4, for some constant c/ > we have 

/ exp{h{9))d9 = I exp(/i(^))d^ + 0(e-'='(^°s")') 



-f/-//((vec.4)^./("-"^)2 



since the entries of A are less than en logn, d does not depend on e and we will choose e 
to be sufficiently small. Recalling (3.21) we now have 

We will simplify the above product using the values of Xp^q given in Lemma 6. First, the 
contribution to the product from l<p, g<A; — lis 



p=l q=l 



p,q \ -^^'^ ^^p,q 



cn(k^-2k + 2)J J-lil ^\ 4cfc2(fc2-2A; + 2) 



exp 



cn(A-2 - 2k + 2) J ^ \ Ack^k^ -2k + 2) 

cn(A;2 _ 2A; + 2) J ^""^ 1 4cA;2(A;2 - 2A; + 2) 



where the last step used Lemmata 7(b) and 7(c). The contribution to the product when 
exactly one of p or g equals k is 

2(fe-l) 



TT 



cn(k-l)(k-2) 



since Lemma 7(a) tells us that ((vecA)^/*^^'*^)^ = when p = k or q = k. When p = q = k 
the contribution to the product is 



TT 



2cn(k - ly 
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Substituting these contributions into (3.22) we get 

g{e)de 



-5,5]'''^ 

( -r, cf (veci) T (J, _ 1 +/, _ 1 ) '^'-VecA 



cn^ (k^ -2k + 2)(*^-i)V2 ((A; - l)(A; - 2))'=-' V2{k - 1) 

\ dn ) (A;2-2A; + 2)(^-i)V2((A;-l)(A;-2))'-'y2(A;-l) 
f2'n \ V^- ^) exp 2(fc2-2fc+2) ; 



— 



V^^^/ -2A; + 2)(^-i)V2(yt-2)^-i 

as required. | 



4 ... and for n not divisible by h 

Define k' = 2k if dk is odd or k' = k otlierwise. Note tliat tlie conditions we assumed so far 
(i.e. n divisible by k and dn even) can be rewritten as simply n = (mod k'). Therefore we 
only need to consider the case n = r (mod k') for each integer r such that < r < A;' and 
dr is even. One possibility is to rework the whole argument of this paper but with slightly 
unbalanced colourings. Instead, the asymmetry in the argument can be somewhat reduced 
by using an argument relating different models of random regular graphs. We first treat the 
case n = (mod k') in more depth, and prove the following. 

Theorem 10 Fix nonnegative integers d > 3, k and i such that d < 2{k — 1) log(A; — 1). 
Consider the d-regular graphs with n vertices (n divisible by k and dn even) and a distinguished 

ordered set of i edges whose endpoints induce a perfect matching (i.e. no two edges in the 
distinguished set are adjacent to the same edge or incident with the same vertex). Let G be 
chosen uniformly at random from, such structures. Then G a.a.s. has a k-colouring in which 
all I distinguished edges have end vertices coloured 1 and 2. 

Proof. Consider the probability space iln,d,t with uniform probability distribution, and whose 
underlying set consists of pairings in Vn,d with an ordered set L oi £ distinguished pairs of 
points, such that no two pairs in L are incident with the same vertex. Let Y denote the 
number of balanced A:-colourings of a pairing containing L, in which the distinguished pairs 
join vertices of colours 1 and 2. We will show that 

EF ~ M EF, (4.1) 
that (1.1) holds with Y replaced by Y (and no other adjustment), and that 

/, \ -2£ 

E(f2)~(J E{Y'). (4.2) 



25 



The theorem then follows immediately by the argument in the last few sentences of the proof 
of Theorem 1 (for n divisible by k) . 

To show (4.1) and the analogue of (1.1), we define for integers r > and pi, . . . > 
N{pi, ... ,pr) to be the set of triples (P, C, F) such that P is a pairing in Vn,d, C is a balanced 
/c-colouring of P and F = (Fjj) i<i<r is an ordered (pi H h Pr)-tuple of different cycles in 

P. In view of that, we can express 

E{Y[X,U ■ ■ ■ [XrU) = '^^^|^--V^^^' and 

\rn,d\ 

E{Y[X,],, ■ ■ ■ [X,],J = Yl ^(^' (4-3) 

I n,aM (p^c,r)eiV{pi,...,p.) 

where /i(P, C) gives the number of choices of the ordered set L of £ pairs in P that have 
the required colours at their ends. Almost all triples in N{pi, . . . ,pr) correspond to pairings 
with dn/ {k{k — 1)) ± 0(n^/^ logn) edges between each two colour classes, while the remaining 
triples contribute with at most a 0(n~®*^'°s")) factor of the total. To verify this claim, observe 
that for each fixed C and F, the number of pairings P G Vn,d compatible with C and F has a 
factor F = } ^ / iY[i<i<j<k^iJ^-) ^ where bij denotes the number of edges between colour 

classes i and j excluding the edges of the cycles in F (sec the computation of EY and E(FXi) 
in Section 2). After using Stirhng's formula to estimate the factorials in F, it is easy to check 
that the main contribution to F is due to terms with all hi j = dn/ {k{k — 1)) ± 0(n^/^ log n) 
and that the weight of the remaining terms is 0{F /n®('°g")) as required. Next, observe that 
h{P,C) ~ (rfn/(A;(A; — 1)))^ if P has dn/{k{k — l))±0{n^^^ log n) edges with endpoints coloured 
1 and 2, and that h{P,C) is always 0{n^). Therefore, we can estimate the sum in the right 
side of (4.3) and combine it with ~ \'Pn,d\{dn/2Y to deduce 

E(y[x4, . . . ~ E(y[x4, • . • [XrU), 

as required for (4.1) and the analogue of (1.1). 

Similarly, to estimate E(y^) wc define A^' to be the set of triples (P, Ci,C2) such that P 
is a pairing in Vn,d and Ci, C2 are balanced fc-colourings of P, and write 

\N'\ 1 
E(1^') = M and ny') = TK—. E h\P,C,,C,l (4.4) 

I n,a\ I n,a,i\ (^p^Ci,C2)€N' 

where h'{P, Ci, C2) gives the number of choices of the ordered set L of £ pairs in P that have the 
required colours at their ends in both colourings Ci and C2. Given P e Vn,d, let bpgrs denote 
the number of edges between points coloured (p, q) and points coloured (r, s) . Wc will show 
that almost all triples in N' correspond to pairings with bpgrs = dn/ (A;^(A; — l)^)±0(n^/^ logn), 
and that the remaining triples are at most a 0(n~®*^'°s"^) fraction of the total. Then (4.2) 
follows immediately from (4.4), since h'{P,Ci,C2) is always O(n^) and also h'{P,Ci,C2) ~ 
{2dn/ {k'^{k — lY)Y whenever 61122 ~ &1221 ~ dn/ {k'^{k—iy). To prove the remaining claim, we 
first recall from the last hues of the proof of Proposition 2(6) that we can restrict our attention 
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to triples {P,Ci,C2) with colour count M = [mp^g] where \mpg — l/k\ < elogn/n^^"^ , since 
all other triples contribute 0{\N'\/n^^^°^^'^) to |A^'|. Observe that while counting |A^'| we 
encounter a factor F' — J2b(m n) Yl 

1/bpqrs^.. We can easily bound the weight in F' 

due to terms in which some bpgrs is not dn/{k'^{k — 1)^) ± 0(n^/^ log n), and find an extra 
factor of compared to the estimation of F' in Lemma 8. | 

Proof of Theorem 1 (for n not divisible by k). It only remains to show that if d > 3 
and d < 2{k — 1) \og{k — 1), then Qri,d is a.a.s. A';-colourable for n not divisible by k. We use 
the type of argument employed at the end of Section 3 of [17]. 

Recall the definition of k' in the beginning of the section, and let r be any integer such that 
< r < /c' and dr is even. Suppose n = r (mod k'). Take a random d-regular graph G with 
n vertices, and assume that the first r vertices vi,V2, ■ ■ ■ ,Vr are at distance at least 4. This 
happens a.a.s. and we simply discard G if this property fails to hold. Delete Vi,V2, . . . ,Vr, 
and join up the former dr neighbours of these vertices by a random perfect matching, which 
is added to what remains of graph G. Leave the dr/2 added edges as a distinguished ordered 
set of edges (in any random order), and observe that none of these edges are adjacent to each 
other by construction. It is easy to show and well known that a given vertex of a random 
d-regular graph is a.a.s. not in a cycle of length less than 4 (or 100, for that matter). It follows 
that a.a.s. no multiple edges occur due to the new edges. Throw the graph away if this last 
property fails to hold. The result is a random d-regular graph on n — r = (mod k') vertices 
with an ordered set of dr/2 distinguished edges, no two adjacent to the same edge or incident 
with the same vertex. Let us call this G'. 

The distribution of G' is uniform, since for each G' the number of ways of reinstating the 
edges to Vi,V2, . . . ,Vr is exactly (^^* J. Thus, by Theorem 10, G' a.a.s. has a /c-colouring 
such that the dr/2 distinguished edges join vertices of colours 1 and 2. We can use exactly 
this colouring on V{G) \ {vi, V2, ■ ■ ■ , Vr}, and colour vi, V2, ■ ■ ■ ,Vr with colour 3 to obtain a 
A;-colouring of G. (Note that A; > 3 from our assumptions on d and k). | 

References 

[1] M. Abramowitz and I. A. Stcgun, editors. Handbook of Mathematical Functions with 
Formulas, Graphs, and Mathematical Tables, volume 55 of National Bureau of Standards 
Applied Mathematics Series. U.S. Government Printing Office, 1972. 10th printing. 

[2] D. Achlioptas and C. Moore. The chromatic number of random regular graphs. In 
Proc. 7th International Workshop on Approximation Algorithms for Combinatorial Opti- 
mization Problems and 8th International Workshop on Randomization and Computation, 
volume 3122 of Lect. Notes Comput. Sc., pages 219-228. Springer, 2004. 

[3] D. Achlioptas and A. Naor. The two possible values of the chromatic number of a random 
graph. Ann. Math., 162(3):1333-1349, 2005. 

[4] N. Alon and M. Krivelevich. The concentration of the chromatic number of random 
graphs. Combinatorica, 17(3):303-313, 1997. 



27 



[5] S. Ben-Shimon and M. Krivelevich. Random regular graphs of non-constant degree: 
Concentration of the chromatic number. Discrete Math., To appear. 

[6] B. Bollobas. A probabihstic proof of an asymptotic formula for the number of labelled 
regular graphs. Eur. J. Combin., 1(4):311~316, 1980. 

[7] B. Bollobas. The chromatic number of random graphs. Combinatorica, 8(l):49-55, 1988. 

[8] A. Coja-Oghlan, K. Panagiotou, and A. Steger. On the chromatic number of random 
graphs. J. Comb. Theory B, 98(5):980-993, 2008. 

[9] C. Cooper, A. M. Frieze, B. A. Reed, and O. Riordan. Random regular graphs of 
non-constant degree: Independence and chromatic number. Comb. Probab. Comput., 
11(4):323-341, 2002. 

[10] N. de Bruijn. Asymptotic methods in analysis. North Holland Publishing Co., third 
edition, 1970. 

[11] A. M. Frieze and T. Luczak. On the independence and chromatic numbers of random 
regular graphs. J. Comb. Theory 5, 54(1):123-132, 1992. 

[12] S. Janson, T. Luczak, and A. Rucihski. Random Graphs. John Wiley, 2000. 

[13] M. Krivelevich, B. Sudakov, V. H. Vu, and N. C. Wormald. Random regular graphs of 
high degree. Random Struct. Algor., 18(4): 346-363, 2001. 

[14] T. Luczak. The chromatic number of random graphs. Combinatorica, ll(l):45-54, 1991. 

[15] T. Luczak. A note on the sharp concentration of the chromatic number of random graphs. 
Combinatorica, ll(3):295-297, 1991. 

[16] M. MoUoy and B. A. Reed. The chromatic number of sparse random graphs. Master's 
thesis. University of Waterloo, 1992. 

[17] R. W. Robinson and N. C. Wormald. Almost all regular graphs are hamiltonian. Random 
Struct. Algor., 5(2):363-374, 1994. 

[18] E. Shamir and J. Spencer. Sharp concentration of the chromatic number on random 
graphs Gn,p- Combinatorica, 7(1): 121-129, 1987. 

[19] L. Shi and N. C. Wormald. Colouring random 4-regular graphs. Comb. Probab. Comput, 
16(2):309-344, 2007. 

[20] L. Shi and N. C. Wormald. Colouring random regular graphs. Comb. Probab. Comput, 
16(3):459-494, 2007. 

[21] N. C. Wormald. Models of random regular graphs. In J. D. Lamb and D. A. Preece, 
editors, Surveys in Combinatorics 1999, volume 267 of LMS Lecture Note Series, pages 
239-298. Cambridge University Press, 1999. 



28 



